1 THE WRAPPER APPROACHRon Kohavi

نویسنده

  • George H. John
چکیده

In the feature subset selection problem, a learning algorithm is faced with the problem of selecting a relevant subset of features upon which to focus its attention, while ignoring the rest. To achieve the best possible performance with a particular learning algorithm on a particular training set, a feature subset selection method should consider how the algorithm and the training set interact. We explore the relation between optimal feature subset selection and relevance. The wrapper method searches for an optimal feature subset tailored to a particular algorithm and a domain. We compare the wrapper approach to induction without feature subset selection and to Relief, a lter approach to feature subset selection. Improvement in accuracy is achieved for some datasets for the two families of induction algorithms used: decision trees and Naive-Bayes. In addition, the feature subsets selected by the wrapper are signiicantly smaller than the original subsets used by the learning algorithms, thus producing more comprehensible models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Aij Special Issue on Relevance Wrappers for Feature Subset Selection

In the feature subset selection problem, a learning algorithm is faced with the problem of selecting a relevant subset of features upon which to focus its attention, while ignoring the rest. To achieve the best possible performance with a particular learning algorithm on a particular training set, a feature subset selection method should consider how the algorithm and the training set interact....

متن کامل

Wrappers for Feature Subset Selection

In the feature subset selection problem a learning algorithm is faced with the problem of selecting a relevant subset of features upon which to focus its attention while ignoring the rest To achieve the best possible performance with a particular learning algorithm on a particular training set a feature subset selection method should consider how the algorithm and the training set interact We e...

متن کامل

Feature Subset Selection Using the Wrapper Method: Overfitting and Dynamic Search Space Topology

In the wrapper approach to feature subset selection, a search for an optimal set of features is made using the induction algorithm as a black box. The estimated future performance of the algorithm is the heuristic guiding the search. Statistical methods for feature subset selection including forward selection, backward elimination, and their stepwise variants can be viewed as simple hill-climbi...

متن کامل

Appears in the First International Conference on Knowledge Discovery and Data Mining (KDD-95) Feature Subset Selection Using the Wrapper Method: Over tting and Dynamic Search Space Topology

In the wrapper approach to feature subset selection, a search for an optimal set of features is made using the induction algorithm as a black box. The estimated future performance of the algorithm is the heuristic guiding the search. Statistical methods for feature subset selection including forward selection, backward elimination, and their stepwise variants can be viewed as simple hill-climbi...

متن کامل

Subset Selection Using the Wrapper Method : Over tting and Dynamic Search Space

In the wrapper approach to feature subset selection, a search for an optimal set of features is made using the induction algorithm as a black box. The estimated future performance of the algorithm is the heuristic guiding the search. Statistical methods for feature subset selection including forward selection, backward elimination, and their stepwise variants can be viewed as simple hill-climbi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998